Model-Based Reinforcement Learning Method for Microgrid Optimization Scheduling

نویسندگان

چکیده

Due to the uncertainty and randomness of clean energy, microgrid operation is often prone instability, which requires implementation a robust adaptive optimization scheduling method. In this paper, model-based reinforcement learning algorithm applied optimal problem microgrids. During training process, current learned networks are used assist Monte Carlo Tree Search (MCTS) in completing game history accumulation, updating network parameters obtain strategies simulated environmental dynamics model. We establish environment simulator that includes Heating Ventilation Air Conditioning (HVAC) systems, Photovoltaic (PV) Energy Storage (ES) systems for simulation. The simulation results show microgrids both islanded connected modes does not affect effectiveness algorithm. After 200 steps, can avoid punishment exceeding red line bus voltage, after 800 result converges loss values value reward converge 0, showing good effectiveness. This proves proposed paper be

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

An integrated approach for scheduling flexible job-shop using teaching–learning-based optimization method

In this paper, teaching–learning-based optimization (TLBO) is proposed to solve flexible job shop scheduling problem (FJSP) based on the integrated approach with an objective to minimize makespan. An FJSP is an extension of basic job-shop scheduling problem. There are two sub problems in FJSP. They are routing problem and sequencing problem. If both the sub problems are solved simultaneously, t...

متن کامل

Model-Free Trajectory Optimization for Reinforcement Learning

Many of the recent Trajectory Optimization algorithms alternate between local approximation of the dynamics and conservative policy update. However, linearly approximating the dynamics in order to derive the new policy can bias the update and prevent convergence to the optimal policy. In this article, we propose a new model-free algorithm that backpropagates a local quadratic time-dependent Q-F...

متن کامل

Reinforcement Learning: Model-based

Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...

متن کامل

Model Based Reinforcement Learning with Final Time Horizon Optimization

We present one of the first algorithms on model based reinforcement learning and trajectory optimization with free final time horizon. Grounded on the optimal control theory and Dynamic Programming, we derive a set of backward differential equations that propagate the value function and provide the optimal control policy and the optimal time horizon. The resulting policy generalizes previous re...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Sustainability

سال: 2023

ISSN: ['2071-1050']

DOI: https://doi.org/10.3390/su15129235